Population-genetic nature of copy number variations in the human genome

نویسندگان

  • Mamoru Kato
  • Takahisa Kawaguchi
  • Shumpei Ishikawa
  • Takayoshi Umeda
  • Reiichiro Nakamichi
  • Michael H. Shapero
  • Keith W. Jones
  • Yusuke Nakamura
  • Hiroyuki Aburatani
  • Tatsuhiko Tsunoda
چکیده

Copy number variations (CNVs) are universal genetic variations, and their association with disease has been increasingly recognized. We designed high-density microarrays for CNVs, and detected 3000-4000 CNVs (4-6% of the genomic sequence) per population that included CNVs previously missed because of smaller sizes and residing in segmental duplications. The patterns of CNVs across individuals were surprisingly simple at the kilo-base scale, suggesting the applicability of a simple genetic analysis for these genetic loci. We utilized the probabilistic theory to determine integer copy numbers of CNVs and employed a recently developed phasing tool to estimate the population frequencies of integer copy number alleles and CNV-SNP haplotypes. The results showed a tendency toward a lower frequency of CNV alleles and that most of our CNVs were explained only by zero-, one- and two-copy alleles. Using the estimated population frequencies, we found several CNV regions with exceptionally high population differentiation. Investigation of CNV-SNP linkage disequilibrium (LD) for 500-900 bi- and multi-allelic CNVs per population revealed that previous conflicting reports on bi-allelic LD were unexpectedly consistent and explained by an LD increase correlated with deletion-allele frequencies. Typically, the bi-allelic LD was lower than SNP-SNP LD, whereas the multi-allelic LD was somewhat stronger than the bi-allelic LD. After further investigation of tag SNPs for CNVs, we conclude that the customary tagging strategy for disease association studies can be applicable for common deletion CNVs, but direct interrogation is needed for other types of CNVs.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

O-27: Genome Instabilities in Preimplantation Development Leading to Genetic Variation between Tissues of Normal Human Fetuses

Background: Origin of midlife copy number variations (CNVs) between tissues in non-genetic diseases is unknown. Such genomic differences caused by post-zygotic events. They might either happen during the life or due to prevalent mosaicism in preimplantation stage. We aim to explore fetal mosaicism and its origins. Materials and Methods: Two apparently normal fetuses were achieved following the ...

متن کامل

O-38: Concurrent Whole-Genome Haplotyping and Copy-Number Profiling of Single Cells

Background Methods for haplotyping and DNA copynumber typing of single cells are paramount for studying genomic heterogeneity and enabling genetic diagnosis. Before analyzing the DNA of a single cell by microarray or next-generation sequencing, a whole-genome amplification (WGA) process is required, but it substantially distorts the frequency and composition of the cell’s alleles. As a conseque...

متن کامل

I-44: Concurrent Whole-Genome Haplotyping and Copy-Number Profiling of Single Cells

Background Methods for haplotyping and DNA copynumber typing of single cells are paramount for studying genomic heterogeneity and enabling genetic diagnosis. Before analyzing the DNA of a single cell by microarray or next-generation sequencing, a whole-genome amplification (WGA) process is required, but it substantially distorts the frequency and composition of the cell’s alleles. As a conseque...

متن کامل

I-37: Establishing High Resolution Genomic Profiles of Single Cells Using Microarray and Next-Generation Sequencing Technologies

The nature and pace of genome mutation is largely unknown. Standard methods to investigate DNA-mutation rely on arraying or sequencing DNA from a population of cells, hence the genetic composition of individual cells is lost and de novo mutation in cell(s) is concealed within the bulk signal. We developed methods based on (SNP-) arraying and next-generation sequencing of single-cell whole-genom...

متن کامل

Applications of multiplex ligation-dependent probe amplification (MLPA) method in diagnosis of cancer and genetic disorders

Introduction: Lots of human diseases and syndromes result from partial or complete gene deletions and duplications or changes of certain specific chromosomal sequences. Many various methods are used to study the chromosomal aberrations including Comparative Genomic Hybridization (CGH), Fluorescent in Situ Hybridization (FISH), Southern blots, Multiplex Amplifiable Probe Hybridisation (MAP...

متن کامل

BIO - 132 Population Genetics of Human Copy Number Variations : Models and Simulation of their Evolution Along and Across the Genomes

Population genetic models play a significant role in human genetic research, since they promise to provide a better understanding of both evolution of normal variations of the genomes as well as development of disease promoting genomic segments. Currently, since we are limited in our knowledge about human demographic history and variations of recombination and mutation rates, large-scale comput...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 19  شماره 

صفحات  -

تاریخ انتشار 2010